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CONTENT ADDRESSABLE MEMORY WITH LATCHING SENSE AMPLIFIER 
FIELD OF THE INVENTION 

[00011 The present invention relates generally to content addressable memory devices, and 
more particularly to data access operations within content addressable memory devices. 

BACKGROUND 

[00021 Content addressable memory (CAM) devices are often used in network switching and 
routing applications to determine forwarding destinations for data packets. A CAM device can 
be instructed to compare a selected portion of an incoming packet, typically a destination field 
within the packet header, with data values, called CAM words, stored in an associative storage 
array within the CAM device. If the destination field matches a CAM word, the CAM device 
generates a CAM index that identifies the location of the matching CAM word within the storage 
array, and asserts a match flag to signal the match. The CAM index is then typically used to 
index another storage array, either within or separate firom the CAM device, to retrieve a 
destination address or other routing information for the packet. 

[00031 The associative storage array of a CAM device, commonly referred to as a CAM array, 
is typically populated with CAM cells arranged m rows and columns. Precharged match lines 
are coupled to respective rows of the CAM cells, and bit line pairs and compare line pairs are 
coupled to respective columns of the CAM cells. Together, the bit line pairs form a data port for 
read/write access to address-selected rows of CAM cells, and the compare line pairs form a 
compare port for inputting comparand values to the CAM array during compare operations. The 
CAM cells themselves are specialized store-and-compare circuits each having a storage element 
to store a constituent bit of a CAM word and a compare circuit for comparing the stored bit with 
a comparand bit presented on the compare lines. In a typical arrangement, the compare circuits 
within the CAM cells of a given row are coupled in parallel to the match line for the row, with 
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each compare circuit switchably forming a discharge path to discharge the match line if the 
stored bit and comparand bit do not match. By this arrangement if any one bit of a CAM word 
does not match the corresponding bit of the comparand value, the match line for the row is 
discharged to signal the mismatch condition. If all the bits of the CAM word match the 
corresponding bits of the comparand value, the match line remains in its precharged state to 
signal a match. Because a comparand value is presented to all the rows of CAM cells in each 
compare operation, a rapid, massively parallel search for a matching CAM word is performed. 
[0004] Figure 1 illustrates a portion of a prior-art CAM device, including a CAM cell 101, 
sense amplifier 141, compare lines (CL and /CL), bit lines (B and /B), and match line (ML). TTie 
CAM cell includes a bi-stable storage element 1 03 having complementary data nodes D and /D 
formed by back-to-back-coupled inverters (i.e., a first inverter formed by drain-coupled 
transistors 107 and 11 1 and a second inverter formed by drain-coupled transistors 109 and 1 13). 
The complementary data nodes, D and /D, are coupled to first and second bit lines, B and /B, via 
pass-gate-configured transistors 123 and 125, respectively. During a read or write operation, a 
word line (WL) is activated (e.g., in response to a decoded address value) to switch on transistors 
123 and 125, thereby enabling the states of the bit lines, B and /B, to be driven by the 
complementary nodes of the storage element 103 or vice-versa. The bit lines are precharged to a 
supply voltage, Vs, by precharge circuits 165 and 167 such that, when the word line is activated 
in a data read operation, whichever of the complementary nodes of the storage element 103 is 
low will sink a current, Ipd, to pull down the corresponding bit line. 
[00051 The sense amplifier 141 includes first and second differential amplifier circuits, 143 
and 145, that amplify the difference between the voltages on the bit lines to generate outputs SI* 
and Sr, respectively. More specifically, differential amplifier circuit 143 includes transistors 
147 and 149 having gate terminals coupled to the bit lines B and /B, respectively, source 
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terminals coupled in common to the drain terminal of a sense-enable transistor 1 54. and drain 
terminals coupled to the drain terminals of load transistors 151 and 153, respectively, -^e 
source terminals of load transistors 151 and 153 are coupled to a supply voltage node, and the 
gate terminals of transistors 151 and 153 are coupled to one another in a current mirror 
configuration. The transistor 151 is coupled in a diode configuration (i.e., gate to drain) to 
establish a self-biased load resistance to pull up the drairi terminals of transistors 151 and 153. 
The drain terminal of transistor 149 forms a first stage output node, Sl^ of the sense amplifier 
141. By this arrangement, when a sense-amp-strobe signal, SAS, is asserted, a differential 
voltage developed on the bit lines, B and /B, will result in one of transistors 147 and 149 
conducting more current than the other, thereby pulling down the drain node of one of the 
transistors 147 and 149 more than the other, and causing the voltage at output node Sl^ to go up 
or down relative to the voltage at the drain of transistor 147. The differential amplifier 145 
includes transistors 155, 157, 159, 161 and 162 coupled in substantially the same mamier as 
transistors 147, 149, 151. 153 and 154 of differential amplifier 143, except that the load 
transistor 161 coupled to the /B bit line is coupled in a diode configuration, and the drain 
terminal of transistor 155 forms a first stage output node, ST, of the sense amplifier 141. Thus, 
as output node SI* goes up, output node SI" goes down, and vice-vers. Overall, the differential 
amplifiers 143 and 145 typically achieve a differential voltage gain (i.e., (Vsi. - Vs.-)/ (Vb-V/b)) 
of approximately 2 to 3. A third differential amplifier (i.e., second stage amplifier, not shown) is 
usually provided to fiarther amplify the first stage outputs. SI* and SI", to conventional logic 
levels. 

[00061 Still refming to Figure 1 . the compare ctait 105 includes transistors 1 15 and 1 1 9 
coupled in series between a match line, ML, and a ground node, and transistors 1 1 7 and 121 also 
coupled in series between the match line and ground node. Oate terminals of transistors 1 19 and 



Atty. Docket No. NLMI.P133 



-4- 



121 are coupled to compare lines /CL and CL, respecHvely, and gate tenninals of transistors 115 
and 117 are coupled to the complementary nodes, D and /D, of the storage element 103. In a 
compare operation, a comparand bit is driven onto fl.e compare lines in complementary fonn 
(i.e.. the comparand bit, C, being driven onto compare Une CL. and complement comparand bit, 
/C, driven onto compare line /CL), fliereby enabling a compare operation within the compare 
circuit 105. If the comparand bit does not match the data bit stored in the storage element, then 
eitatransistors 115 and 119 will both be switched on (i.e.,D=l. /CI) or transistors 117 and 
121 will both be switched on (i.e., /D=l, C=l), in both cases forming a path between the match 
line and ground to discharge the match line and signal the mismatch condition. If the comparand 
bit matches the databit, then at least one transistor in each transistor pair 1 15/1 19 and 1 17/121 
will be switched oft, U.ereby interrupting the path to ground. The match line is pulled up to a 
logic hi^ state by a match line precharge circuit, MPC, so that, if no other CAM cells coupled to 
the match line detect a mismatch co»lition, the match line will remain high to signal the maKi 
condition. 

100071 Figure 2 illustmtes typical data and control signals generated during a read operation 
within the prior-art CAM cell 101 and sense amplifier 141 of Figure 1 . Initially, a. time Tl, the 
woni line, WL, is asseried to enable the complementary nodes (D and /D) of the storage element 
103 to affect the states of the bit lines, B and IB. Assuming that a logic • 1 • is stored in the 
stomge element, then ID node will be low, thereby pulling the IB bit line low. In a typical 0.13 
. micrometer process, IB will be pulled down from its precharged level by approximately 100 
millivolts (100 mV), thereby establishing a lOOmV differential at the inputs of the sense 
amplifier 141 . At time T2, after /B has been pulled down, the sense-amp-strobe signal, SAS, is 
asscried to switch on the sense-enable transistors 1 54 and 1 62, thereby enabling the differential 
■plifiers 143 and 145 to generate an amplified differential output of approximately 300mV at 
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Sl^ sr. A second stage amplifier, not shown in Figure 1. further amplifies the differential 
output by a factor of 3 to approximately 900 mV; a voltage level sufficient to drive a logic gate 
such as a NAND gate or logic-level inverter. 

[0008] As process geometries shrink, a number of challenges are presented in the prior-art 
sense amplifierl41 and CAM cell 101, particularly in carrying out data read operations. As an 
initial matter, in processes having critical dimensions (CDs) of 0.13u or lower, threshold voltage 
mismatch (i.e., Vx mismatch) in the sense amplifier inputs (i.e., in transistor pairs 147/149 and 
155/157) increase significantly, effectively reducing the differential data signal available for 
amplification. Vt mismatches as high as 30mV have been observed, amounting to roughly 30% 
of the -lOOmV differential signal typically generated on the bit lines. B and /B. The loss of 
effective differential signal is compounded by actual differential signal loss due to pass gate 
leakage and junction leakage in transistors 123 and 125. Pass gate leakage (i.e., sub-threshold 
current flow through transistors 123 and 125) is particularly problematic as the total amount of 
leakage is dependent on the data pattern stored in the column of CAM cells coupled to sense 
amplifier 141. More specifically, the sub-threshold current flow through transistor 125 is a 
fimction of the potential difference developed between the transistor source and drain terminals, 
and is therefore considerably higher when node D is at a logic '0' level than when node D is at a 
logic ' 1 ' level. While the logic '0' pass gate leakage current is relatively small, upwards of 1 0 
mV of differential signal may be lost in a worst case scenario (e.g., when a logic '0' is stored on 
all data nodes coupled to a given bit line except the data node being read). Also, sub-threshold 
current flow in transistors 123 and 125 increases as device thresholds drop. Device thresholds, 
in turn, scale with process geometry so that loss of signaling margin due to pass gate leakage 
increases as process geometries shrink. Junction leakage is a discharge current flowing through 
the drain/source-to-substrate junction of the transistors 123 and 125. The total discharge current 
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is a function of the total number of CAM cells coupled to a given pair of bit lines and therefore 
increases with the dimension of the CAM array. As process geometries shrink, CAM cell arrays 
have grown increasingly dense, with modem CAM devices having thousands of CAM cells 
coupled to each bit line pair. Such devices exhibit a non-negligible junction leakage that further 
reduces the differential signal observed by the sense amplifier. As process geometries progress 
toward 100 nanometers and below, the combined effects of Vt mismatch, pass gate leakage and 
junction leakage increasingly prevent the prior-art sense amplifier 141 from generating reliable 
logic level outputs, thereby reducing device yield and increasing cost. Moreover, the differential 
signal generated on the bit lines is also a fimction of the supply voltage level, and is therefore 
expected to be forther reduced as supply voltages drop progressively from 1.0 to 0.9 to 0.65 volts 
and below. 

[00091 Another problem presented by the prior-art sense amplifier 141 and CAM cell 101 of 
Figure 1 relates to interference between concurrently executed compare and read operations. As 
noted above, separate access and compare ports are provided to the CAM array, thus enabling 
read and compare operations to be carried out simultaneously. One side-effect of data read 
operations; however, is that the pull-down current. Ipo, sunk by the logic '0' node of the storage 
element 103 produces a nonzero voltage on that node and therefore at the gate of the 
corresponding transistor (1 15 or 117) of the compare circuit 105. For example, if a logic '0' 
value is stored in the storage element 103 when a read operation is initiated (i.e.. D=0), the pull- 
down current. Ipd flowing through transistors 123 and 107 when the word line is activated causes 
avoltage to develop across transistor 107 and therefore at the output node, D. As shown in 
Figure 2. typical voltages observed at the output node are in the 200mV range and therefore 
substantially increase the sub-threshold leakage current through the transistor 115 of the compare 
circuit. While the increased leakage current in a single CAM cell is usually not enough to pull 
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down the match line, when multiplied by the number of similarly increased leakage currents in 
other CAM cells of the row being read, the match line may undesirably be pulled low even 
though the CAM word matches the comparand value.presented on the compare lines, thus 
resulting in a false mismatch indication. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[00101 The present invention is illustrated by way of example, and not by way of limitation, in 
the figures of the accompanying drawings and in which like reference numerals refer to similar 

elements and in which: 

Figure 1 illustrates a portion of a prior-art CAM device, including a CAM cell, sense 

amplifier, compare lines, bit lines, and match line; 

Figure 2 illustrates typical data and control signals generated during a read operation 
within the prior-art CAM cell and sense amplifier of Figure 1 ; 

Figure 3 illustrates a column of CAM cells and a latching sense amplifier according to an 

embodiment of the invention; 

Figure 4 is a timing diagram showing exemplary control and data signals during a read 
operation withm the sense amplifier and a CAM cell of Figure 3; 

Figure 5 is a timing diagram showing a potential misread due to a signal perturbation on 

the bit lines of Figure 3; 

Figure 6 illustrates an embodiment of a write driver that may be used in combination with 

the latching sense amplifier of Figure 3; 

Figure 7 illustrates exemplary control signal timing and corresponding bit line states in an 

embodiment according to Figure 6; 

Figure 8 illustrates a column of CAM cells arranged in multiple sense groups and 
corresponding multi-stage sense amplifier circuitry according to an embodiment of the invention; 

Figure 9 is a timing diagram showing exemplary control and data signals during a read 

operation within tiie circuitry of Figure 8; 

Figure 10 is a timing diagram showing exemplary control and data signals during a write 

operation within the circuitry of Figure 8 ; 
Atty. Docket No. NLMI.P 133 -9- 



Figure 1 1 illustrates an embodiment of an address decoder that may be used with a CAM 
array having CAM cells arranged in sense groups as shown in Figure 8; 

Figure 1 2 illustrates a latching sense amplifier and column of CAM cells according to an 

alternative embodiment of the invention; 

Figure 1 3 is a timing diagram showing exemplary control and data signals during a read 

operation within the circuit arrangement of Figure 12; 

Figure 14 illustrates an embodiment of a ternary CAM cell that may be used in place of 
the binary CAM cells of Figure 3, or to implement the CAM cells depicted in Figures 8 and 12; 

Figure 15 illustrates an embodiment of a quaternary CAM cell that may be used in place 
of the binary CAM cell of Figure 3, or to implement the CAM cells depicted in Figures 8 and 12; 

Figure 16 illustrates a dual bit line arrangement for reading and writing data to a column 
of ternary or quaternary CAM cells according to an embodiment of the invention; and 

Figure 17 illustrates a CAM device having latching sense amplifiers according to an 
embodiment of the invention. 
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DETAILED DESCRIPTION 

lOOll] In the following description and in the accompanying drawings, specific terminology 
and drawing symbols are set forth to provide a thorough understanding of the present invention. 
In some instances, the terminology and symbols may imply specific details that are not required 
to practice the invention. For example, the interconnection between circuit elements or circuit 
blocks may be shown or described as multi-conductor or single-conductor signal Unes. Each of 
the multi-conductor signal lines may alternatively be single-conductor signal lines, and each of 
the single-conductor signal lines may alternatively be multi-conductor signal lines. Signals and 
signaling paths shown or described as being single-ended may also be differential, and vice- 
versa. Similarly, signals described or depicted as having active-high or active-low logic levels 
may have opposite logic levels in alternative embodiments. As another example, circuits 
described or depicted as including metal oxide semiconductor (MOS) transistors may 
alternatively be implemented using bipolar technology or any other technology in which a 
signal-controlled current flow may be achieved. With respect to terminology, a signal is said to 
be "asserted" when the signal is driven to a low or high logic state (or charged to a high logic 
state or discharged to a low logic state) to indicate a particular condition. Conversely, a signal is 
said to be "deasserted" to indicate that the signal is driven (or charged or discharged) to a state 
other than the asserted state (including a high or low logic state, or the floating state that may 
occur when the signal driving circuit is transitioned to a high impedance condition, such as an 
open drain or open collector condition). A signal driving circuit is said to "output" a signal to a 
signal receiving circuit when the signal driving circuit asserts (or deasserts, if explicitly stated or 
indicated by context) the signal on a signal line coupled between the signal driving and signal 
receiving circuits. A signal line is said to be "activated" when a signal is asserted on the signal 
line, and "deactivated" when the signal is deasserted. Additionally, the prefix symbol "/" 
Atty. Docket No. NLMI.P133 -1 1- 



attached to signal names indicates that the signal is an active low signal (i.e., the asserted state is 
a logic low state). A line over a signal name (e.g.. ' < signal name > ') is also used to indicate an 
active low signal. The term "terminal" is used to mean a point of electrical comiection. The term 
"exemplary" is used to express but an example, and not a preference or requirement. 
[0012] In embodiments of the present invention, a latching sense amplifier is used to sense 
differential voltages generated on bit lines of a CAM array. The sense amplifier senses an initial 
differential voltage generated on the bit lines and, in response, latches corresponding logic-level 
signals on the bit lines themselves. By this operation, even a relatively small differential voltage 
on the bit lines is effectively amplified to logic levels by the latching sense amplifier, thereby 
avoiding many of the signal-loss problems that plague the prior-art sense amplifier arrangement 
of Figure 1, and providing headroom for smaller process geometries and lower supply voltages, 
hi one embodiment, the latching sense amplifier latches the bit lines to rail-to-rail differential 
potentials (e.g., supply voltage and ground) so that the pull down current used to establish the 
initial, relatively small-swing differential signals on the bit lines is quickly reduced to zero. 
Consequently, the compare-read interference problem described in reference to Figures 1 and 2 
may be avoided, enabling more reliable simultaneous read and compare operations to be 
performed. In other embodiments, multiple stages of latching sense amplifiers are used to 
segment the CAM-cell-coupled bit lines into multiple sets of shorter bit lines coupled to fewer 
CAM cells, thereby reducing the loss of differential signal amplitude that otherwise results fix>m 
pass gate leakage and junction leakage. 

r AM Array with Static Storage F.lements and Latching Sense AmpUfier 
[00131 Figure 3 illustrates a column of CAM cells 201,-201n and a latching sense amplifier 
233 according to an embodiment of the invention. Each of the CAM cells 201,-201n is coupled 
to a respective one of match lines 251 ,-251n (ML,-MLn) and a respective one of word lines 
Atty. Docket No. NLMI.P133 -12- 



261 r26lN (WL.-WLn). with the match lines 251 and word lines 261 extending across respective 
rows of CAM cells within the larger CAM array. Each of the CAM cells is also coupled to a 
common pair of bit lines 257 and 259 (i.e., B and IB, respectively) and to a common pair of 
compare lines 253 and 255 (i.e., CL and /CL, respectively). By this arrangement, data access 
operations and compare operations may be executed concurrently (i.e., at least partly overlapping 
in time) as data signals may be transmitted on the bit lines 257. 259 for read/write purposes while 
the compare lines 253, 255 are simultaneously driven with complementary comparand signals. 
In an alternative embodiment, the comparand signals and data signals may be time multiplexed 
onto a single pair of signal lines, for example, to reduce the number of signal lines extending 
across the columns of the CAM array. 

[00141 In the embodiment of Figure 3, each of the CAM cells 201 is abinary CAM cell that 
includes a compare circuit 207, static storage circuit 205, and access-enable transistors 229 and 
231 (i.e., pass gates). The static storage element may be any volatile or non-volatile storage 
element capable of storing a binary bit for an indefinite period (i.e., without requiring multiple 
refresh actions per second as in the case of dynamic random access memory) including, without 
limitation, a static random access memory (SRAM) cell, an electrically erasable programmable 
read only memory (EEPROM or flash EEPROM) cell, a thyristor-based storage device, and so 
forth. In the particular embodiment of Figure 3, the static storage circuit 205 is a bi-stable 
storage element implemented by back-to-back coupled inverters 209 and 21 1 (i.e., fonning a 
latch circuit). Inverter 209 includes drain-coupled transistors 213 and 217 and has an output that 
forms data node 218, and an input coupled to the output of inverter 211. Inverter 211 includes 
drain-coupled transistors 215 and 219 and has an output node that forms complement node 220, 
and an input coupled to the output of inverter 209. The data node 21 8 has a state that 
corresponds to the data bit (D) stored within the storage circuit 205, and the complement node 
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220 has a state that corresponds to the complement of the data bit stored within the storage 
circuit (i.e.. /D). The data node 21 8 is coupled to bit line 257 (B) via access enable transistor 229 
and the complement node 220 and is coupled to bit line 259 (/B) via access enable transistor 231. 
Control terminals (i.e.. gates of the MOS transistors depicted in Figure 3) of tiie access enable 
transistors 229 and 231 are coupled to the word line, WLl, so that, when the word line is 
activated, the storage circuit is accessible for read and write operations. 
[00151 The compare circuit 207 includes transistors 221 and 225 coupled in series between the 
match line 251 , and ground, and transistors 223 and 227 also coupled in series between the 
match line 251, and ground. Gate terminals of transistors 227 and 225 are coupled to the 
compare lines 253 and 255 (CL and /CL), respectively, and gate terminals of transistors 221 and 
223 are coupled to the data node 21 8 and complement node 220 of the storage circuit, 
respectively. By this arrangement, if a comparand bit, C, presented on the compare lines 253, 
255 (i.e., in complementary fomi, C and /C) does not match the data bit stored in the storage 
circuit 205, then either transistors 221 and 225 will both be switched on (i.e., D=l, /C=l) or 
transistors 223 and 227 will both be switched on (i.e., /D=l . C=l) to discharge the match line 
25 1 1 (the match line 251 . being otherwise pulled up to a logic high state by match line precharge 
circuit 242 (MPC)). Conversely, if the comparand bit do^ match the data bit, then at least one 
transistor in each transistor pair 22 1/225 and 223/227 will be switched off so that, unless another 
CAM cell in the same row pulls the match line down, the match line remains in the pre-charged 
condition (i.e., unless another CAM cell within the same row pulls the match line down in 
response to a mismatch condition). Other compare circuits may be used in alternative 
embodiments. 

[0016] The latching sense amplifier 233 includes latching transistors 235 and 237, and an 
enable transistor 239. The latching transistors 235 and 237 are cross-coupled with the gate 

Atty. Docket No. NLMI.P133 "14- 



tenninal of transistor 235 coupled to the drair. terminal of transistor 237, and the gate terminal of 
transistor 237 coupled to the drain terminal of transistor 235. drain terminals of transistors 
235 and 237 (and therefore the gate terminals of transistors 237 and 235, respectively) are 
coupled to the bit lines 257 and 259. respectively. The source terminals of the latching 
transistors 235 and 237 are coupled in common to the drain terminal of the enable transistor 239. 
The source of the enable transistor 239 is coupled to ground, and the gate terminal of the enable 
transistor 239 is coupled to a strobe line to receive a sense amp strobe signal 275 (SAS). 
[0017] Figure 4 is a timing diagram showing exemplary control and data signals during a read 
operation within the sense amplifier 233 and a CAM cell 201 of Figure 3. Initially, at time Tl, 
word line 261 . (i.e., WLO is activated to enable access to the storage elements within the 
corresponding row of CAM cells. Referring to CAM cell 201. access transistors 229 and 231 
are switched on in response to assertion of word line 261 thereby forming a signal path between 
the data node 21 8 and bit line 257, and a corresponding signal path between the complement 
node 220 andbit line 259. Assuming that a logic '0' data value is stored in the storage circuit 
205, the data node 218 will be low and so that a pull-down current will begin to flow from the 
precharged bit Une 257 (i.e., the bit lines 257 and 259 being precharged by precharge circuits 271 
and 273, respectively) through access transistor 229 and through transistor 213. As shown in 
Figure 4, the bit line 257 (B) is initially pulled down by approximately 1 OOmV (other pull-down 
levels may be generated in alternative embodiments). During this time, a voltage of 
approximately 200mV develops at the data node 218 due to the pull-down current flowing 
through transistor 213 (different voltage drops may develop across transistor 213 in alternative 
embodiments). 

100181 At time T2, the sense amp strobe signal 275 is asserted to enable a sense operation 
,vithin the latching sense amplifier 233. Because the voltage on bit line 259 (/B) is higher than 
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the voltage on bit line 257 (B), transistor 235 is biased to conduct more current than transistor 
237, so that bit line 257 (B) is further discharged, causing the voltage at the gate of transistor 237 
to drop further, switching transistor 237 toward an increasingly non-conducting state and, by 
operation of precharge circuit 273, causing the voltage at the gate of transistor 235 to rise. By 
this regenerative operation, shortly after the sense amp strobe signal 275 is asserted, transistor 
235 is fiilly switched on so that bit line 257 is discharged substantially to ground level, and 
transistor 237 is fully switched off (i.e., conducting negligible current) so that bit line 259 
remains charged to a supply voltage level. The latching sense amplifier 233 remains latched in 
this state so long as the sense enable signal 275 is asserted. As shown graphically in Figure 4, 
the initial differential voltage level on the bit lines 257 and 259 (i.e., B and /B) is reinforced by 
the operation of the latching sense amplifier 233, with the more discharged one of the bit lines 
(B) being pulled down and latched at a logic low level (e.g., at or near the ground potential), and 
the other, more charged bit line (/B) being latched at a logic high level (e.g., at or near the supply 
voltage potential). Thus, both bit lines 257 and 259 are effectively latched at differential rail-to- 
rail levels (e.g., supply and ground voltage levels) and therefore the bit lines themselves may be 
used to drive downstream logic gates such as inverters, NAND gates and the like. 
[00191 Still referring to Figures 3 and 4, as the latching sense amplifier 233 latches the bit lines 
257 and 259 at logic low and high levels, the pull-down current (Ipd) flowing through access- 
enable transistor 229 and transistor 213 drops to zero (or a negligible value), thereby reducing 
the voltage at data node 218 to a level at or near ground. Consequently, the sub-threshold 
leakage that occurs throughout the data read operation in the prior art circuit of Figure 1 is 
reduced to a negligible level by the latching operation of the latching sense amplifier 233, 
enabling a compare operation to be executed concurrently with the data read operation (i.e., as 
indicated by compare strobe signal, CS) without false mismatch due to sub-threshold leakage. 

Atty. Docket No. NLMI.P133 -16- 



10020] Reflecting on the operation of the latching sense amplifier 233. it can be seen that the 
latching sense amplifier affects the state of the bit lines 257 and 259 themselves, driving the bit 
lir^es to the desired logic level voltages, so long as an initial differential voltage greater than the 
Vt mismatch of transistors 235 and 237 is developed on the bit lines 257 and 259. By contrast, 
the prior-art sense amplifier 141 of Figure 1 presents a high impedance input to the bit lines, 
effectively isolating the bit lines fix,m the sense amplifier output. Consequently, while the 
latching sense amplifier 233 of Figure 3 reinforces the differential voltage on the bit lines 257 
and 259, driving the bit lines to logic levels, the prior-art sense amplifier 141 requires that a 
sufficiently high differential voltage be developed to enable the multiple gain stages to generate a 
logic level output. For example, assuming that, after taking pass gate and junction losses into 
account, a 40 mV differential signal is generated at the input of the prior-art sense amplifier 141 
and also at the input of the latching sense amplifier 233 of Figure 3. Assume fiarther that the 
input transistors of the prior-art sense amplifier 141 and the latching transistors 235 and 237 of 
the latching sense amplifier each exhibita30mVVxmismatch.Inthat case, the prior-art sense 

amplifier 141 will generate a first stage signal of approximately 30 mV (i.e., 3*(40mV.30mV)), 
and a second stage signal of approximately 90mV; a signal level insufficient to drive a logic- 
level inverter or other logic gate. By contrast, the latching sense ampUfier 233 will respond to 
the effective lOmV differential (i.e., 40mV signal less 30mV Vt mismatch) by latching the lower 
level signal atalogiclowlevel(e.g., at or near ground) and latching the higher levd 
logicWghlevel(e.g., at ornear the supply voltage). T1.US, in addition to enabling simul^^^^^ 

read and compare operation without the interference that plagues the prior-art sense amplifier 
141, the latching sense amplifier 233 of Figure 3 provides considerable additional headroom as 
differential signal margin is lost to shrinking process geometries and supply voltages. Also, as 
shown in the embodiment of Figure 3, the latching sense amplifier 23 3 may be implemented with 
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substantially fewer transistors than the multi-stages of the prior-art sense amplifier 141. 
potentially reducing die area consumed by a bank of such sense amplifiers, or multiple banks of 
such sense amplifiers as in the case of a CAM device having multiple CAM arrays. 
[00211 At this point it bears noting that, while sense amplifiers of the type shown in Figure 3 
have conventionally been used in dynamic random access memories (DRAMs). such sense 
amplifiers have a number of undesirable characteristics when used with memories having static 
storage elements such as static random access memories (SRAMs). First, the latching sense 
amplifier 233 of Figure 3 tends to operate more slowly than the prior art sense amplifier 141 of 
Figure 1, primarily because of the additional time required to fiiUy discharge and re-charge the 
relatively high-capacitance bit lines 257 and 259. For most SRAM applications, this is a major 
drawback, as one of the key advantages of SRAM as compared to DRAM is low-latency, rapid 
access. In CAM applications, by contrast, compare throughput is often the most important 
performance metric, so that exchanging access time for the ability to concurrently perform 
reUable read and compare operations is a desirable in many applications. 
[00221 Another undesirable characteristic of the latching sense amplifier 233 of Figure 3 is the 
potential for reading an incorrect data state. Referring to Figure 5, shortly after the word line, 
WL, for a given CAM row is activated, signals on the bit lines B and /B may each exhibit some 
perturbation as shown at 321 . If the sense amp strobe signal (SAS) is asserted at time TSi, (i.e., 
as shown by dashed line 323) before the bit line states settle to proper differential levels, the 
latching sense amplifier 233 may latch an incorrect data state as shown at 322 (i.e., bit line /B is 
pulled low due to its initial more negative state than bit line B). Worse, because the incorrect 
data state is latched on the bit lines themselves, the data state stored in the CAM cell itself will 
be flipped, thereby corrupting the content of the corresponding CAM row. While this problem 
may be avoided by carefial timing of the sense amp strobe signal relative to the word line 
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activation (e.g., asserting at time TS,, a predetermined time after assertion of the word line. WL, 
and after the bit lines B and /B have settled to proper differential levels), this additional timing 
concern warrants additional design and verification effort to ensure against data misreads and 
storage corruption. 

no^o WritP with Bit Lin e nrive Assistan ce from L atching Sense AmpUfier 
[0023] Figure 6 illustrates an embodiment of a write driver 340 that may be used in 
combination with the latching sense amplifier 233 of Figure 3. Tkc write driver 340 includes a 
first pair of transistors 341 and 345 coupled in series between bit line 257 and ground, and a 
second pair of transistors 343 and 347 coupled in series between bit line 259 and ground. A 
write enable signal 348 (WE) is coupled to gate terminals of transistors 345 and 347, and a 
complement write data signal 346 (/WD) is coupled to a gate terminal of transistor 341 and, via 
inverter 349, to a gate terminal of transistor 343. By this arrangement, when the write data value 
is a logic '1' (i.e.. complement write data signal 346 is low), transistor 341 is switched off and 
transistor 343 is switched on and, conversely, when the write data value is a logic '0', transistor 
341 is switched on and transistor 343 is switched off. When the write enable signal is asserted, 
transistors 345 and 347 are switched on to form a path to ground through whichever of 
transistors 341 and 343 is switched on, discharging the corresponding bit line. When the bit line 
is sufficiently discharged, the state of the static storage circuit (e.g., circuit 205 of Figure 3) wiU 
be flipped or preserved to store tiie write data value, depending on whether the write data value is 
different from the previously stored value. 

[0024] In one embodiment, tiie latching sense amplifier 233 is used to reinforce the data states 
driven onto the bit lines 257 and 259 by the write driver 340, thereby permitting the write driver 
340 to be implemented with substantially smaller drive ti-ansistors (i.e., transistors 341, 343, 345, 
and/or 347) relative to drive transistors conventionally used to drive the bit lines of a CAM 
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device. Figure 7 illustrates exemplary control signal timing and corresponding bit line states in 
such an embodiment. Initially, at time Tl, the write enable signal, WE, is asserted to enable the 
write driver 340 to pull down one or the other of the bit lines (B and /B). A predetermined time 
later, at T2, the sense amp strobe signal 275 (SAS) is asserted to switch on transistor 239 and 
enable the latching operation of cross-coupled transistors 235 and 237 of the latching sense 
amplifier 233. As in a data read operation, the latching sense amplifier 233 drives the more 
discharged one of the bit lines 257 and 259 to a level at or near ground as shown at 359, and 
enables the more charged one of the bit lines (i.e., charged by precharge circuit 271 or 273) to 
remain charged at or near the supply voltage level. At time T3, after the latching sense amplifier 
has driven the bit lines to full-rail differential levels, a word line, WL, is activated to enable the 
write data value reflected by the states of bit lines 257 and 259 to be stored within the 
corresponding CAM cell. 

[0025] Still referring to Figure 7, it can be seen that the two-stage bit line discharge operation 
(i.e., first pulling down one of the bit lines in the write driver 340, then in both the write driver 
340 and latching sense amplifier 233) enables more rapid discharge of the low-going bit line than 
if the write driver 340 alone was used. In an alternative embodiment, the same rapid discharge 
rate may be achieved by providing a stronger transistor pull-down in the write driver 340 (e.g., 
by increasing the size of transistors 341, 343, 345 and/or 347) without enabling the latching 
operation of the latching sense amplifier 233. Also, in other embodiments, other write driver 

circuits may be used to drive the bit lines 257 and 259 alone or in combination with the latching 

sense ampUfier 233. 

Multistage Sense Amplifier Arrangement 

[0026] Figure 8 Ulustrates a column of CAM cells arranged in multiple sense groups 401 r 

40lK and corresponding multi-stage sense amplifier circuitry according to an embodiment of the 
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invention. Each of the K sense groups 401 r401K (SG.-SGk) includes a pair of group bit lines 
403, 405 (GB and /GB), a group sense amplifier 410 (GSA), a bit line precharge circuit 408 (i.e., 
to precharge the group bit lines), and a set of CAM cells, C1-C(M). Each of the M CAM cells is 
coupled to the group bit lines 403 and 405, to a respective one of row word lines RWLi-RWLm, 
and to a respective match line (not shown in Figure 8). A pair of compare lines 407 and 409 (CL 
and /CL) are coupled in common to each of the CAM cells. C1-C(M), within each of the sense 
groups 401 ,-401k. The group sense amplifier 410 is a latching sense amplifier having transistors 
431, 433 and 435 coupled to one another and to the group bit lines 403 and 405 in the same 
mamier as the transistors 235, 237 and 239 are coupled to one another and to bit lines 257 and 
259 in the embodiment of Figure 3. Thus, each sense group 401 is implemented in essentially 
the same manner as the column of CAM cells 201r20lN and latching sense amplifier 233 
described in reference to Figure 3, except that, assuming that the aggregate number of CAM cells 
in all the sense groups 401 is equal to the number of CAM cells per column in the embodiment 
of Figure 3 (i.e., N=M x K), the total number of CAM cells coupled to the group bit lines 403 
and 405 is reduced by a factor of (1/K) relative to the number of CAM cells coupled to bit lines 
257 and 259, and the group bit lines 403 and 405 are correspondingly shortened relative to the 
lengths of bit lines 257 and 259. By reducing the number of CAM cells coupled to a given pair 
of bit lines, pass gate leakage and junction leakage are correspondingly reduced, thereby 
providing additional signaling headroom on the bit lines for next generation processes and 
supply voltages. 

10027] Still referring to Figure 8, a common pair of compare lines is coupled to each of the M 
CAM cells in each of the sense groups 401 so that compare operations are carried out 
simultaneously throughout the entire CAM array. Also, the group bit lines 403, 405 of each 
sense group 401 are coupled, via respective group access transistors 416 and 418, to a pair of 
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colunm bit lines 41 1 and 413 (CB and /CB). A latching sense amplifier 419. referred to herein as 
the colunm sense amplifier.is coupled to the colunm bit lines411 and 413 as is a write driver 
462. The column sense amplifier 419 includes latching transistors 421 and 423, and enable 
transistor 425 coupled to one another and to the column bit lines 41 1 and 4 1 3 in the same manner 
as transistors 235, 237 and 239 are coupled to one another and to bit lines 257 and 259. 
[00281 Figure 9 is a timing diagram showing exemplary control and data signals during a read 
operation within the circuitry of Figure 8. Initially, at time Tl, a row word line, RWL, is 
asserted to enable access to one of the CAM cells within a selected one of sense groups 401 
401k. As discussed below, the row word lines within the various sense groups 401 may be gated 
according to a group address so that a word line in only one sense group 40 1 is activated during a 
given read operation. Alternatively, corresponding row word lines in all the sense groups 401 
may be activated during each read operation. As shown at 464, the states of the group bit lines 
GB and /GB within a selected sense group begm to diverge according to the value stored within 
the CAM cell selected for access by the row word line. At time T2, a group sense amp strobe 
signal, GSAS (i.e., signal 451 in Figure 8), is asserted to enable a sense operation within the 
group sense amplifier 410. In response, the group sense amplifier 410 amplifies the differential 
signal levels on the group bit lines in the manner discussed in reference to Figures 3 and 4, 
thereby establishing rail-to-rail differential signal levels on the group bit lines GB and /GB as 
shown at 466. At time T3, the group word line, GWL, for the selected sense group is activated 
to form a path between the column bit lines CB and /CB and the group bit lines for the sense 
group. Consequently, as shown at 468, the signal levels on the column bit lines CB and /CB 
begin to differentiate (i.e., the column bit lines initially being pulled up by pull-up circuits 41 5 
and 417) until, at time T4, a column sense amp strobe signal, CSAS (signal 453 in Figure 8), is 
asserted to enable a sense operation within the column sense amplifier 41 9. The column sense 
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amplifier 419 responds to assertion of the column sense amp strobe signal by amplifying the 
differential signal levels on the column bit lines, driving the column bit line states to the rail-to- 
rail differential potential shown at 470. 

[00291 Still referring to Figures 8 and 9, in one embodiment, the transistors used in the group 
sense amplifiers 41 0 are smaller than corresponding transistors 235, 237 and 239 in the latching 
sense amplifier of Figure 3, as substantially shorter bit lines are being discharged. Consequently, 
when the group word line is activated to form a signal path between the group bit lines 403, 405 
and the column bit lines411,413, the pulled-up column bitlines411 and 413 act to pull up the 
low-level one of the group bit lines as shown at 471 of Figure 9. The voltage rise on the low- 
level group bit line may vary in amplitude and duration according to the strength of the group 
sense amplifier 410. At time T4, when the column sense amplifier is activated, the column sense 
amplifier drives the low-level column and group bit lines to a substantially discharged state (i.e.. 
at or near ground) thereby ensuring that the voltage on the data node or complement node of the 
CAM cell being read is at or near ground for the duration of the read operation. In an alternative 
embodiment, the transistors within the group sense amplifiers 410 (i.e., transistors 431, 433, 435) 
may be large enough to pull down both the group bit lines 403 and 405 and. after activation of 
the group word line (GWL). the column bit lines 411 and 413. In such an embodiment, the 
column sense amplifier 419 may be omitted. 

[0030] Figure 10 is a timing diagram showing exemplary control and data signals during a 
write operation within the circuitry of Figure 8. Initially, at time Tl. the write enable signal. WE 
(i.e., signal 348 in Figure 8) is asserted to enable the write driver 462 to drive the column bit 
Unes CB and /CB to different states as shown at 474 (i.e., in response to a write data value 346). 
At time T2, the column sense amplifier is activated by assertion of the CSAS signal (signal 453 
in Figure 8) to reinforce the operation of the write driver 462, driving the column bit lines to rail- 
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to-rail differential levels as shown at 476. At time T3, an address-selected group word line. 
GWL, is activated to enable the colunm bit lines, CB and /CB. to drive the group bit lines, GB 
and /GB as shown at 478. At time T4, an address-selected row word line, RWL, is activated to 
enable the state of the group bit lines to be stored within the corresponding CAM cell. 
[0031] Still referring to Figures 8 and 10. in an alternative embodiment, after the group word 
line is activated, the group sense amplifier 410 for the selected sense group may be activated 
(i.e., by assertion of the group sense amp strobe signal 451) to more rapidly drive the group bit 
lines to rail-to-rail differential states. Assertion of the group sense amp strobe signal 451 may 
also be used to counteract the initial Vt drop across group access transistors 416 and 418, as the 
group sense amplifier 410 will drive the group bit lines 403 and 405 to rail-to-rail differential 
potentials. Alternatively, or additionally, the group access transistors 416 and 418 may be 
replaced by CMOS (complementary MOS) pass gates to avoid the Vt drop. Also, in an 
alternative embodiment, the group sense amp strobe signal 451 may be asserted instead of the 
column sense amp strobe signal 453 to effect a write operation (e.g., asserting the group sense 
amp strobe signal 451 after assertion of the write enable signal 348 and activation of the group 
word line (GWL)). In another alternative embodiment, the write driver 462 alone may include 
drive transistors having sufficient strength to drive the column bit lines 41 1 , 41 3 and group bit 
lines 403, 405 to tiie desired states. 

[00321 Figure 1 1 illustrates an embodiment of an address decoder 490 that may be used with a 
CAM array having CAM cells arranged in sense groups as shown in Figure 8. The address 
decoder 490 includes a group address decoder 493, sub-group address decoder 495 and group 
enable circuits 507,-507k. The group address decoder 493 and sub-group address decoder 495 
are coupled to receive respective segments 492 and 494 of an address value 491. T^e address 
value 491 may include additional segments of one or more bits that are used, for example, to 
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select one of multiple CAM arrays within a CAM device or a subset of a row of CAM cells 
within a given CAM array (e.g., if multiple, individually accessible CAM words or multiple 
segments of a single CAM word are stored within a row of a CAM array). Address segment 492 
is referred to herein as the group address segment, and includes log^CK) bits that correspond to 
(i.e., decode to) one of the K group word lines, GWL.-GWLk that extend across the CAM array. 
Shnilarly, address segment 494 is referred to herein as the sub-group address segment, and 
includes log2(M) bits that correspond to one of the M row word lines that extends across each 
row of sense groups of the CAM array. In the embodiment of Figure 1 1, the group address 
segment 492 constitutes the most significant bits of the address 491, and the sub-group address 
segment 494 constitutes the least most significant bits of the address 491. In alternative 
embodiments, the group address segment and sub-group address segment may be ordered in any 
significance relative to one another and to other bits of the address 491. 
[0033] In the embodiment of Figure 1 1, the group address decoder 493 activates a group- 
address-indicated one of group enable lines 497,.497k, with each of the group enable lines 497 
being coupled to a respective one of group word lines GWL,-GWLk via a respective group pass 
gate 505, - 505k. Gate tenninals of the pass gates 505 are coupled in common so that the pass 
gates are switched as a group between conducting and non-conducting states by assertion and 
deassertion of an enable signal (EN-GWL). The sub-group address decoder 495 activates a sub- 
group-address-indicated one OfM row enable lines 499,-499m in response to a decode enable 
signal 501 (EN-RWL), with the M row enable lines 499 being coupled to each of K sets ofM 
row word lines (i.e.,RWLn-RWL,M, RWL„-RWLaM, RWLk,-RWLkm) via a respective one 
of group enable circuits 507.-507k. Each of the group enable circuits 507 includes a set ofM 
pass gates 509r509M coupled respectively between the M row enable lines and the 
corresponding set ofM row word lines. Gate terminals of the pass gates 509 within each group 
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enable circuit 507 are coupled in common to a respective one of the group enable lines 497 so 
that the pass gates 509 within each group enable circuit 507 are switched as a group between 
conducting and non-conducting states according to the state of the corresponding group enable 
line 497. By this arrangement, only one row word line in one set of M row word lines is enabled 
during a given read or write operation, thereby avoiding the need for the sub-group address 
decoder 495 to drive a large number of relatively long row word lines. Also, by providing 
separate enable signals 501 and 503 to the sub-group address decoder 495 and the group pass 
gates 505, respectively, the address-selected group word line and row word line may be activated 
at different times, according to whether read or write operations are being performed. The 
enable signals 501 and 503 are generated, for example, by an instruction decoder or other control 
circuit according to host-requested access instructions or as part of an internally controlled 
access operation (e.g., reading a sequence of locations within the CAM array as part of a 
background error checking operation). 

[0034] Figure 12 illustrates a latching sense amplifier 523 and column of CAM cells 521 1- 
521n according to an alternative embodiment of the invention. The CAM cells 521 are coupled 
to pair of bit lines 524 and 526 as in the embodiment of Figure 3 (or group bit lines of a sense 
group as in the embodiment of Figure 8). The bit lines 524 and 526 are precharged to a supply 
voltage level by transistors 539 and 541 in response to i precharge signal 543 (PRE). Additional 
always-on precharge circuits may additionally or alternatively be coupled to the bit lines 524 and 
526. Match lines, word lines and compare lines are also coupled to the CAM cells 521, but not 
shown in Figure 12. The latching sense amplifier 523 includes the cross-coupled transistors 525 
and 527, and enable transistor 529 coupled in the configuration described in reference to Figure 
3, and additionally includes isolation transistors 53 1 and 533, local precharge transistors 535 and 
537, and inverter 549. Gate terminals of the local precharge transistors 535, 537, and the input 
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of inverter 549 are coupled in common to receive a sense amp pull-up control signal 545 
(SAPUC), and the output of inverter 549 is coupled to the gate terminal of the enable transistor 
529. The isolation transistors 531 and 537 are coupled to a select line to receive a sense amp bit 

select signal 547 (SABS). 

[0035] Figure 13 is a timing diagram showing exemplary control and data signals during a read 
operation within the circuit arrangement of Figure 12. Initially, at time Tl. the precharge signal 
543 (PRE) goes low for a predetermined interval to precharge the bit lines 524 and 526 to supply 
voltage levels. As discussed, an always-on precharge circuit may be provided to maintain the 
precharged state of the bit lines 524 and 526 after the precharge signal 543 is deasserted (e.g., to 
a high state). At time T2, a word line (WL) is activated to enable an address-selected one of the 
CAM cells 521 to drive a differential signal onto the bit lines 524 and 526. At time T3, the sense 
amp bit select signal 547 (SABS) goes high to switch on transistors 531 and 533, thereby 
forming a path between the gate terminals of transistors 525 and 527 and the bit lines 526 and 
524, respectively. Also, at or before time T3, the sense amp pull-up control signal 545 (SAPUC) 
goes high to switch the local precharge transistors 535 and 537 and the sense amp enable 
transistor to a substantially non-conducting state. By this operation, the voltage levels at the gate 
terminals 525 and 527 differentiate according to the states of the corresponding bit lines 526 and 
524. Consequently, when the sense amp pull-up control signal 545 is asserted (i.e., goes low) at 
time T4. the enable transistor 529 is switched on (i.e., by operation of inverter 549) to enable 
transistors 525 and 527 to latch the states of the bit lines 524 and 526 at rail-to-rail differential 
potentials. At time T5, the sense amp bit select signal 547 is deasserted to isolate the latching 
sense amplifier 523 from the bit lines 524 and 526, thereby retaining the rail-to-rail differential 
potentials of the bit lines 524 and 526 at the drain terminals of transistors 525 and 527, but 
pemiitting the word line (WL) to be deasserted and the bit lines 524 and 526 to be precharged. 
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At time T6, the word line is deactivated to enable the bit lines to be precharged in preparation for 
a subsequent read or write operation. Note that the latched state of the latching sense amplifier 
523 will remain until the next rising edge of the sense amp pull signal 545. Accordingly, the 
latched nodes of the latching sense amplifier 523 (i.e., drain terminals of transistors 525 and 527) 
may be used to drive logic level circuits (e.g., for output firom the CAM device or for storage in 
other registers or circuits of the CAM device) even as the bit lines 524 and 526 are precharged in 
preparation for a subsequent data access operation. 

T gfPhin ff Sense AmpUfi^rs roupled t o T e rnary an d Quaternary CAM CeUs 
[0036] ReferringagaintoFigure3,eachoftheCAMcells201r20lNisabinaryCAMcell 
having a single storage circuit to store either a logic ' 1 ' or logic '0' value. In alternative 
embodiments, each of the CAM cells may be a ternary CAM cell or quaternary CAM cell having 
multiple storage circuits. Figure 14 illustrates an embodhnent of a ternary CAM cell 570 that 
may be used in place of the binary CAM cells 201 of Figure 3, or to implement the CAM cells 
depicted in Figures 8 and 12. The ternary CAM cell 570 includes a first and second static 
storage circuits 571 and 573, and a compare circuit formed by transistors 575, 579, 581, 583 and 
585. The first static storage circuit 571 is used to store a data value and is referred to herein as a 
data storage circuit, while the second static storage circuit 573 is used to store a mask value and 
is referred to herein as a mask storage circuit 573. Gate terminals of transistors 579 and 581 are 
coupled to the data node and complement data node of static storage circuit 571 (i.e., to receive a 
data signal and complement data signal, respectively), and gate terminals of transistors 585 and 
583 are coupled to receive a comparand bit and complement comparand bit, respectively, from a 
pair of compare lines. In one embodiment, a match line 587 is coupled directly to the drain 
terminals of transistors 583 and 585 (i.e., as in the binary CAM cell of Figure 3), and transistor 
575, referred to herein as a mask transistor, is coupled between the source terminals of transistors 
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579 and 581 and ground. A gate terminal of the mask transistor 575 is coupled to a complement 
node of the mask storage circdt 573 so that, when a logic 'O' value is stored in the mask storage 
circuit 573, the mask transistor 575 is switched on, thereby enabling mismatch conditions to be 
signded by the compare circuit. When a logic • 1 ' value is stored in the mask storage circuit 573, 
the mask transistor is switched off to prevent a mismatch condition ftom being signaled. Thus, 
the ternary CAM cell 570 effectively stor« a "don't care" state when the mask value is a logic 
. 1 •, as the match line is unaffected by the CAM ceU 570 regardless of whether the data value 
stored in datastorage circuit 571 matches the corresponding bit ofthe comparand value. In an 
altem^ve embodiment, the mask transistor 575may be disposed between the match line and the 

drain nodes of transistois 583 and 585. Also, in another altemaHve embodiment, the 
eomplement mask value may be logically ANDed with the comparand bits, C and /C, to generate 
the signals applied at the gate temtinals of transistors 583 and 585 (i.e., thereby preventing either 
of transistors 583 and 585 from being switched on if the mask value is alogic T). Similarly, 
the complement mask value may be logically ANDed with data value and complement data 
value to generate the signals applied at the gatetemiinals of transistors 579 and 581, 
respectively. Although the ternary CAM cell 570 is depicted as having a single compare circuit 
(i.e., hnplemented by transistors 575, 579, 581, 583 and 585) coupled to a single match line 587, 
the data and mask storage circuits 571 and 573 may alternately be coupled to multiple compare 
circuits coupled respectively to multiple mw match lines, thereby enablmg simultaneous or 
pipelined comparisons with multiple comparand bits and enabling the match results to be 
signaled simultaneously or in pipelined fashion on the multiple row match Imes. 
(00371 Figure 1 5 iUustrates an embodiment of a, uatentary CAM cell 600 that may be used m 

place of the bmary CAM cell 201 of Figure 3, or to implement the CAM cells depicted in 
Figures 8 and 12. n,e quaternary CAM cell 600 includes a pair of storage elements 601 and 603 
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u, s.or= X and Y bits. resp«tively, of a quaternary data value (i.e., four-state data value). In one 
embodiment, encoding circuitiy is used to convert data and mask values into X and Y bits of a 
quaternary value according to the following logic table: 



Mask 


Data 


X 


Y 


0 


0 1 


0 


1 


0 1 


1 


1 


0 


1 


0 


0 


0 


1 


1 


0 


0 



Table 1 

[0038] Note that when the input mask value is zero (i.e., a non-masking state), the X and Y bits 
of the quaternary data value correspond to the databit and complement databit (i.e., the two 
possible states of a binary CAM cell), with the X bit being supplied to the gate terminal of 
transistor 605 and the Y bit being provided to the gate terminal of transistor 607. 
Complementary comparand signals, /C and C, are provided to the gate termmals of transistors 
609 and 611 , respectively, so that the quaternary CAM cell 600 operates like a binary CAM cell 
when the input mask yalue is zero. By contrast, when the input mask value is a logic '1', logic 
'0' values are stored m both the X and Y storage elements 601 and 603. Consequently, 
transistors 605 and 607 are both switched off, ensuring that the quaternary CAM cell 600 will 
not affect the state of the match line 613 regardless of the state of the comparand signals. The 
encoding circuitry for performing the data/mask-to-XA^ conversion set forth above in Table 1 
above may beprovidedinawrite driver circuit(e.g., circuit 340ofFigure 6) or elsewh^e in the 

write data path. Also, the encoding circuitry may be omitted, and the encoding performed 
outside the CAM device. In another embodiment, logic ' 1 ' values may be stored in each of the 
X and Y storage elements 601 and 603 to establish an "always match" state within the quaternary 
CAM cell 600. The "always match" state may be useful, for example, in array testing and may 
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be controlled by a device mode selection (e.g.. a mode bit stored within a configuration circuit 
within tiie CAM device) and/or a host instruction. As with the data and mask storage circuits of 
Figure 14. the X and Y storage circuits 601 and 603 may be coupled to multiple compare circuits 
to enable the quaternary data value stored therein to be compared simultaneously or in pipelined 
fashion with multiple comparand bits and the match results signaled simultaneously or in 
pipelined fashion on multiple row match lines. 

[00391 Figure 1 6 illustrates a dual bit line arrangement for reading and writing data to a 
column of ternary or quaternary CAM cells 631,-631n according to an embodiment of the 
invention. The column of CAM cells 631 ,-631>, form part of a larger CAM array, and each 
include two storage elements 635 and 637 (i.e., SI and S2) which may be used to store mask and 
data bits of a ternary CAM cell or X and Y bits of a quaternary CAM cell. Storage element 635 
of ^h CAM cell is coupled to a first pair of bit lines 661 and 663 (BLl and /BLl) via pass 
gates 639 and 641, and storage element 637 is coupled to a second pair of bit lines 665 and 667 
(BL2 and /BL2) via pass gates 643 and 645. Bit Unes 661 and 663 are coupled to a first latching 
sense amplifier 651 (SAl) and write driver 655 (WDl), and bit lines BL2 and /BL2 are similarly 
coupled to a second latching sense amplifier 653 (SA2) and write driver 657 (WD2). In one 
embodiment, each of the latching sense amplifiers 651 and 653 is implemented in the same 
manner as the latching sense amplifier 233 of Figure 6, and each of the write drivers 655 and 657 
are implemented in the same manner as the write driver 340 of Figure 6. Alternatively, the 
latching sense amplifier 523 of Figure 12 may be used to implement the sense amplifiers 651 and 
653, and other write driver embodiments may be used to implement the write drivers 655 and 
657. Also, each pair of bit lines 661/663 and 665/667 and con-esponding latching sense 
amplifier 651/653 may form a sense group that is coupled, along with other sense groups, to a 
pair of column bit lines as discussed in reference to Figure 8. 
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100401 Still referring to Figure 16, a single word line (i.e., one of word lines 648,-648n) is 
coupled to each row of CAM cells and, when activated, enables both storage elements 635 and 
637 to be accessed simultaneously from each CAM cell of the corresponding row. Alternatively, 
one word line may be coupled to the storage elements 635 within the row and another word line 
may be coupled to the storage elements 637 within the row, thereby enabling only the constituent 
bits of a CAM word to be accessed, or only the constituent mask bits of a mask word to be 
accessed, or the constituent bits of both the CAM word and mask word to be accessed (all tiie X- 
bits, Y-bits or botii X- and Y:bits may be accessed in the case of quaternary CAM cells). In such 
an embodiment, a single pair of bit lines may be coupled to both storage elements 635 and 637 of 
each CAM cell 631, and tiie different word lines used to select which of the storage elements is 
accessed during a given read or write operation. 

Tvr«iti-Rlnck CA M ncvice with T atchin^ Sense AmpMfiers 
[00411 Figure 17 illustrates a CAM device 700 having latching sense amplifiers according to 
an embodiment of the invention. The CAM device 700 may be implemented in a dedicated 
integrated circuit (IC) device or as a portion of an IC device (or IC package) that includes other 
circuit blocks or features, such as a general or special purpose processor (e.g., network processor 
or digital signal processor), microcontroller, memory controller, and so forth, the embodiment 
of Figure 17, the CAM device 700 includes multiple independently selectable CAM blocks. 
705^705;, instruction decoder 719, address logic 711, global flag circuit 731 and global priority 
encoder 733. A number of other circuit blocks, not shown, may also be included witiiin the 
CAM device 700 including, without limitation, input/output drivers, status registers, 
configuration circuits, associated storage, and so forth. As shown in ti>e detail view of CAM 
block 705,, each of the CAM blocks 705 includes a CAM array 701, block flag circuit and block 
priority encoder, and read/write circuitry 715. The CAM array 701 may include columns of 
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CAM cells each arranged as described in reference to Figure 3, and coupled to a latching sense 
amplifier and write driver within the read/write circuit 71 5 as described in reference to Figures 3 
and 7. Alternatively, each column of CAM cells within the CAM array 701 may be organized in 
sense groups as described in reference to Figure 8, with separate group and column bit lines and 
separate group and column sense amplifiers. The column sense amplifiers may be disposed 
within the read/write circuit 71 5 along with a column write driver, and the group sense amplifiers 
disposed within the CAM array 701 adjacent the corresponding groups of CAM cells. 
[0042] Instructions such as read, write and compare instructions are issued to the CAM device 
700 by a host processor, network ptX)cessor or other control device (not shown) via an instruction 
bus 702. In the case of read and write instructions, the control device may additionally issue 
address values to the CAM device 700 via address bus 706 to specify storage locations to be 
accessed in the CAM array 701, such addresses including, for example the group address and 
sub-group address segments described in reference to Figure 1 1 . 

[0043] The instruction decoder 719 responds to instructions received via the instruction bus 
702 by signaling other circuit blocks within the CAM device 700 to perform the instructed 
operation, including issuing control signals to control the read and write operations described in 
reference to Figures 3-13. The address logic 711 receives addresses firom the address bus 706 as 
well as from address sources within the CAM device 700 including, without limitation, a next 
free address (NFA) register that indicates a highest priority, unoccupied location within a CAM 
array; a highest priority match (HPM) register that contains a match address generated after a 
compare operation within the CAM device; an error check register that contains an address of a 
storage location within a given CAM array (or all the CAM arrays) to be error-checked (e.g., for 
parity error, cyclic redundancy check error, error correction code error, or other error); and any 
other desirable source of addresses. During a read or write access to a CAM array 701 within 
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one of CAM blocks 705, the address source is selected by the address logic 71 1 in response to 
one or more control signals 742 from the instruction decoder 7 1 9 (or other control circuitry), and 
decoded by the address logic 71 1 to activate a word line (or combination of group and row word 
lines as in Figure 1 1) that corresponds to a selected row of CAM cells within the CAM array 
701 . Read and write operations are then carried out in the manner described above in reference 
to Figures 3-13. In one embodiment, a host requested read operation is carried out by activating 
an address-selected word line (or combination of group and row word lines) that extends across 
the CAM arrays in all or a portion of the CAM blocks. A block address field of the address is 
used to enable the sense amplifier circuits within one of the CAM blocks, thereby resolving the 
overall read operation to a particular row of CAM cells (or portion thereof) within a particular 
CAM block 705. During an error checking operation, the sense amplifiers within all the CAM 
blocks 705i-705j (i.e., within the read/write circuits 715) may be enabled concurrently (i.e., at 
least partly overlapping in time) to enable CAM words, mask words and corresponding error 
check values to be read from the same row location within each CAM array 701 in each CAM 
block 705, and checked for error in a respective error detection circuit 710. Because compare- 
read mterference is avoided by using the latching sense ampUfiers described in reference to 
Figure 3 (and in reference to other Figures above), such error checking operations may be carried 
out without interrupting (or interfering with) the stteam of compare operations executed within 
the CAM device 700. 

[00441 In one embodiment, the error detection circuit 710 includes a parity checking circuit 
that generates one or more parity bits based on the content of the CAM word (or mask word) 
read from the CAM array 701, then compares the generated parity bits to parity bits read from 
the CAM array 701 along with the CAM word under test. If the generated parity bits do not 
match the stored parity bits, the error may be signaled by assertion of an error signal 712 (ERR) 
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and/or by recording an error status in the status register of the CAM device 700. The address 
fiom which the CAM word in error was read may be stored in a register for later access by the 
host processor, network processor or other control device. The error detection circuit 710 may 
perform a parity error check, cyclic redundancy check, checksum check, syndrome check (e.g., a 
syndrome generated from an error correction code) or any other type of error checking operation. 
[0045] In the embodiment of Figure 17, each of the CAM blocks 705 includes a comparand 
register 703 to store a comparand value received via a data bus 704, and drive the comparand 
value onto the compare lines of the corresponding CAM array 701. Alternatively, the 
comparand register 703 may be omitted and the comparand value driven directly from the data 
bus 704 onto the compare lines. Comparand registers 703 within the different CAM blocks 705 
may be loaded with the same or different comparand values and may be enabled to drive the 
comparand values onto the compare lines of the corresponding CAM arrays 701 simultaneously, 
or in pipelined fashion. Within each CAM block 705, compare results generated within the 
CAM arrays 701 are output on match lines 725 to the block flag circuit 707 and block priority 
encoder 709. The block flag circuit 707 outputs a block match flag signal 742 to indicate 
wheflier a match has been detected in the corresponding CAM array 701, and may additionally 
output a multiple match flag signal (not shown) to indicate whether multiple matches have been 
detected wifliin the CAM array 701 . The block priority encoder 709 outputs a block index 744 
(i.e., an address value) that corresponds to the CAM array location containing flie highest priority 
CAM word determined to match the comparand value (CAM word priority may be 
programmable or fixed in different embodiments). The block match flags from each of the CAM 
blocks 705i-705j are input to a global flag circuit 73 1 which generates a device match flag signal 
732 according to whether any matches were detected in the CAM device 700. As wifli the block 
flag circuit 707, flie global flag circuit 731 may also output a device multiple match flag signal to 
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indicate whether multiple matches were detected in the CAM device 700, The block indices 74 
generated by the CAM blocks 705,-705j are input to a global priority encoder 733 which 
determines a highest priority one of the block indices 744 and outputs a corresponding device 
index 734. The device index 734 includes the highest priority one of the block indices 744 
together with a block address segment that identifies the CAM block 705 that sourced the highest 
priority block index. In one embodiment, the block priority encoder 709 within each CAM block 
705 outputs a predetermined "no-match" code when none of the match signals 725 is asserted. 
The global priority encoder 733, in turn, treats the no-match code is as having a lowest of all 
possible priorities, so that a match detected within any of the CAM blocks will result in selection 
of the corresponding block index over the no-match code. In an alternative embodiment, the 
block match flags 742 are provided to the global priority encoder 733 along with the block 
indices 744. If a block match flag 742 is deasserted (i.e., to indicate that no match was detected 
within the corresponding CAM block 705), then the block index 744 fi-om the same CAM block 
705 is eliminated as a source of the block index portion of the device index 734. 
[0046] In one embodiment, each device index 734 (or each device index for which the 
corresponding device match flag 732 is asserted) is supplied to the address logic 71 1 and used to 
read the corresponding CAM word fi-om the indicated storage location (i.e., firom a specified row 
of the CAM array 701 within a specified one of CAM blocks 705,-705j) for error checking 
purposes. If an error is detected by the error detection circuit 710, then a qualifying error signal 
712 is output fi-om the CAM device along with the device index 734, to inform the host 
processor, network processor or other control device that the device index (i.e., the match 
address) resulted from a match with a corrupted CAM word. Although output latency is 
increased by such error checking, the error checking operation may be executed concurrently 



Atty. Docket No. NLMI.P133 



-36- 



with a subsequent compare operation, so that compare and error checking operations are 
effectively pipelined to maintain the compare throughput of the CAM device. 
[00471 Although the CAM device of Figure 17 includes multiple CAM blocks, this is not 
required. In alternative embodiments, CAM devices having single CAM blocks may include the 
CAM cell, sense amplifier and address decoder arrangements described in reference to Figures 3- 
13. Also, in one embodiment, the data bus 704 is used to transmit write and read data to and 
from the CAM device 700 (i.e., in addition to delivering comparand values) and is therefore 
coupled to the read/write circuit 715 within each CAM block. In alternative embodiments, one 
or more of the address, instruction and data buses may be eliminated and the corresponding 
signals time multiplexed onto the remaining bus or buses. 

[00481 The section headings provided in this detailed description are for convenience of 
reference only, and in no way define, limit, construe or describe the scope or extent of such 
sections. Also, while the invention has been described with reference to specific embodiments 
thereof, it will be evident that various modifications and changes may be made thereto without 
departing from the broader spirit and scope of the invention. The specification and drawings are, 
accordingly, to be regarded in an illustrative rather than restrictive sense. 
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